Search CORE

18 research outputs found

What you say and how you say it : joint modeling of topics and discourse in microblog conversations

Author: Gao Cuiyun
He Yulan
King Irwin
Li Jing
Lyu Michael
Zeng Jichuan
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 18/03/2019
Field of study

This paper presents an unsupervised framework for jointly modeling topic content and discourse behavior in microblog conversations. Concretely, we propose a neural model to discover word clusters indicating what a conversation concerns (i.e., topics) and those reflecting how participants voice their opinions (i.e., discourse).1 Extensive experiments show that our model can yield both coherent topics and meaningful discourse behavior. Further study shows that our topic and discourse representations can benefit the classification of microblog messages, especially when they are jointly trained with the classifier

arXiv.org e-Print Archive

Warwick Research Archives Portal Repository

Code Structure Guided Transformer for Source Code Summarization

Author: Gao Cuiyun
Gao Shuzheng
He Yulan
Lyu Michael R.
Nie Lun Yiu
Xia Xin
Zeng Jichuan
Publication venue
Publication date: 22/07/2022
Field of study

Code summaries help developers comprehend programs and reduce their time to infer the program functionalities during software maintenance. Recent efforts resort to deep learning techniques such as sequence-to-sequence models for generating accurate code summaries, among which Transformer-based approaches have achieved promising performance. However, effectively integrating the code structure information into the Transformer is under-explored in this task domain. In this paper, we propose a novel approach named SG-Trans to incorporate code structural properties into Transformer. Specifically, we inject the local symbolic information (e.g., code tokens and statements) and global syntactic structure (e.g., data flow graph) into the self-attention module of Transformer as inductive bias. To further capture the hierarchical characteristics of code, the local information and global structure are designed to distribute in the attention heads of lower layers and high layers of Transformer. Extensive evaluation shows the superior performance of SG-Trans over the state-of-the-art approaches. Compared with the best-performing baseline, SG-Trans still improves 1.4% and 2.0% in terms of METEOR score, a metric widely used for measuring generation quality, respectively on two benchmark datasets

arXiv.org e-Print Archive

INFAR: insight extraction from app reviews

Author: GAO Cuiyun
KING Irwin
LIN Chin-Yew
LO David
LYU Michael R.
ZENG Jichuan
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/11/2018
Field of study

Crossref

Institutional Knowledge at Singapore Management University

Emerging app issue identification from user feedback: Experience on WeChat

Author: DENG Yuetang
GAO Cuiyun
KING Irwin
LO David
LYU Michael R.
ZENG Jichuan
ZHENG Wujie
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/05/2019
Field of study

Crossref

Institutional Knowledge at Singapore Management University